Rank in Wordlist | Frequency | Word |
---|---|---|
2464 | 18 | 2,5 |
3362 | 13 | 1,4 |
3363 | 13 | 1,5 |
4678 | 9 | 1,1 |
4679 | 9 | 1,6 |
5163 | 8 | 0,8 |
5165 | 8 | 1,2 |
5166 | 8 | 1,7 |
5167 | 8 | 1,8 |
5174 | 8 | 4,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
25080 | 1 | (K)ein bisschen schwanger |
Rank in Wordlist | Frequency | Word |
---|---|---|
25080 | 1 | (K)ein bisschen schwanger |
Rank in Wordlist | Frequency | Word |
---|---|---|
2753 | 16 | 10% |
3138 | 14 | 15% |
3612 | 12 | 20% |
4266 | 10 | 2% |
4269 | 10 | 7% |
4681 | 9 | 100% |
4686 | 9 | 5% |
4687 | 9 | 50% |
5171 | 8 | 25% |
5785 | 7 | 30% |
Rank in Wordlist | Frequency | Word |
---|---|---|
2844 | 16 | S&P |
8378 | 5 | S&P 500 |
10270 | 4 | Roth & Rau |
10417 | 4 | Standard & Poor's |
12575 | 3 | H&M |
18466 | 2 | J&J |
18558 | 2 | Julie & Julia |
19123 | 2 | M&A |
25079 | 1 | & Co |
27473 | 1 | AT&T-Titel |
Rank in Wordlist | Frequency | Word |
---|---|---|
705 | 63 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
3019 | 15 | Moody's |
3909 | 11 | 100'000 |
4680 | 9 | 10'000 |
4688 | 9 | 50'000 |
5172 | 8 | 30'000 |
7674 | 5 | 20'000 |
7694 | 5 | 60'000 |
8828 | 5 | gibt's |
9176 | 4 | 1'000 |
9189 | 4 | 150'000 |
Rank in Wordlist | Frequency | Word |
---|---|---|
4764 | 9 | Google + |
18895 | 2 | Kühne + Nagel |
18896 | 2 | Kühne+Nagel |
31150 | 1 | Blick+Bild |
32183 | 1 | CPH Chemie + Papier |
32255 | 1 | Canal + |
32562 | 1 | Chemie + Papier Holding |
34244 | 1 | E+1 |
40481 | 1 | Home+Foyer |
40573 | 1 | Huber+Suhner |
Rank in Wordlist | Frequency | Word |
---|---|---|
6579 | 6 | 2010/11 |
6580 | 6 | 2011/12 |
7454 | 6 | km/h |
9491 | 4 | DEVISEN/Euro |
9544 | 4 | EUROPA/Ausblick |
10768 | 4 | awp/sda |
11580 | 3 | 2008/2009 |
12321 | 3 | FRANKFURT/Ausblick |
12789 | 3 | KEYSTONE/AP |
15807 | 2 | 2012/13 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots